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TESTS  FOR  EXFONENTIALITT  WHEN  ORIGIN  AND  SCALE 


PARAMETERS  ARE  UNKNOUN 

By 

John  Spinelll  and 
Michael  A.  Stephens 


1.  INTRODUCTION 

A  test  of  exponent iallty  Is  a  test  of  a  given  random  sample 

of  n  values  of  x  comes  from  the  exponential  distribution 

(1)  F(x)  -  1  -  exp(-(x-A)/B),  X  >  A  , 

where  B  Is  a  positive  constant.  This  distribution  will  be  referred 
to  as  exp(A,B),  and  A  Is  the  origin  of  the  distribution.  Through¬ 
out  the  paper  we  shall  suppose  the  sample  has  been  labelled  In  ascending 

order,  so  that  A<x,  <x„<**»<x. 

X  z  n 

Many  tests  of  the  above  null  hypothesis  have  been  proposed 

In  the  literature;  In  the  great  majority  of  these  tests,  B  Is  unknown, 
and  A  Is  known.  Then  without  loss  of  generality  A  can  be  assumed 
zero;  If  A  Is  not  zero  the  substitution  y^  *  x^-A  produces  a 
sample  of  ordered  y-values  from  exp(0,B)  when  is  true.  The 
parameter  B  Is  either  estimated  directly  from  the  sample  or  Is 
eliminated  by  exploiting  connections  between  the  exponential  and  the 
uniform  distributions. 

In  this  paper  we  discuss  tests  when  both  A  and  B  are  unknown. 
These  tests  have  so  far  not  been  considered  extensively,  and  a  possible 
reason  for  this  Is  because  a  simple  device  exists  to  change  a  test  of 


H  with  A  unknown  into  a  test  with  A  ~  0.  This  is  to  make  the 
o 

substitution  ^  *  l,...(n-l;  it  is  a  well  known  result 

that  this  substitution  produces,  on  R^,  an  ordered  random  sample  of 

n-1  values  of  v  from  exp(0,B).  Thus  the  test  of  can  be  reduced 

to  a  test  that  the  come  from  exp(0,B),  B  unknown,  using  any  of 

the  many  tests  of  this  hypothesis  that  have  been  proposed.  This  method 

of  dealing  with  the  unknown  A  we  shall  call  Method  1.  Although 

Method  1  eliminates  A,  it  may  not  necessarily  give  the  most  powerful 

test  of  H  ,  and  we  therefore  investigate  two  main  alternative 
o 

approaches,  in  which  A  is  estimated  directly  rather  than  eliminated. 
The  first  of  these  alternatives,  which  we  shall  call  Method  2,  is  to 
estimate  A  and  B,  and  then  to  use  tests  based  on  the  empirical  distri¬ 
bution  function  (EDF).  A  second  alternative,  called  Method  3,  is  to 
use  a  test  based  on  a  regression  of  the  order  statistics  x^, . . . ,x^ 
on  suitable  constants  along  the  horizontal  axis.  For  example,  the 
order  statistics  may  be  regressed  against  m^  where  m^  is  the 
expected  value  of  the  1-th  order  statistic  from  exp(0,l).  On  H^, 
we  have 

(2)  -  A  +  Bm^ 

where  E  denotes  expectation,  and  suitable  test  statistics  are  measures 
of  how  well  the  order  statistics  fit  the  line  (2).  One  such  statistic, 
for  example,  is  the  Shaplro-Wllk  (1972)  statistic  W^,  and  other 
statistics  recently  proposed  in  the  literature  are  measures  of  the 
correlation  between  x.  and  m. ,  or  between  x.  and  statistics 


2.  EDF  STATISTICS 


EDF  statistics  ace  statistics  based  on  the  discrepancy  between 
the  distribution  (1)  above,  with  estimates  used  for  A  and  B,  and 
the  eiq>lrlcal  distribution  function  of  the  sample  of  x-values.  Many 
statistics  have  been  proposed  to  measure  this  discrepancy,  and  we 
here  concentrate  on  those  which  are  usually  called  D**,  D  ,  D,  V, 

W^,  and  A^ .  The  formal  procedure  for  calculating  these  statis¬ 
tics  is  as  follows: 

(a)  Calculate  the  following  estimates  of  A  and  B: 

A  At  ^  ^  H 

B  ■  n(x-x.)/(n-l)  and  A  ■  x.  -B/n,  where  x  «  ^  x./n. 

^  ^  1-1  ^ 


(b)  Calculate 

As  A 

w^  “  (x^-A)/B  and  •  l-expC-Wj^)»  for  i  -  l,...,n. 


Cc)  The  test  statistics  are  then  given  by  the  following  formulas: 


"“KKn 


•  "“kk,  >*i- 


D  -  iiiax(D^,D  ) 


V  -  D  +  D 
n 


„2  .  f  ,  _  2i:ix‘  .  _l_ 

^  1-1  '  ^  ^  ^ 

9  2  -  1  2  _  5 

-  w  -n(z-«)  ,  where  z  -  i  z./n 

^  1-1  ^ 

9  1  “ 

A  "  ‘  t  I  (2i-l)(ln  z^  +  ln(l-z^_^j^_^)  -n. 


The  estimates  A  and  B  given  above  are  derived  from  generalized 

least  squares;  they  can  be  obtained  also  from  the  maximum  likelihood 

estimates,  modified  to  make  them  unbiased.  The  distributions  of  the 

test  statistics,  on  H^,  will  depend  on  n  but  not  on  the  true  values 

of  A  and  B.  These  distributions  are,  for  finite  n,  difficult  to 

2  2 

find,  but  asymptotic  theory  Is  available  for  the  statistics  W  ,  U 
2 

and  A  .  The  asymptotic  distributions  are  the  same  as  those  tabulated 
under  the  heading  Case  4  by  Stephens  (1974,  1976);  Case  4  refers  to 
the  situation  In  which  A  Is  zero  and  only  B  Is  estimated  by  maximum 

/s. 

likelihood.  The  Introduction  of  the  new  estimate  A,  in  addition  to 

>v  /V 

B,  does  not  effect  the  asymptotic  distribution,  because  A  has 
variance  of  the  order  1/n  ,  and  so  Is  super-efficient  In  the  nota¬ 
tion  of  Darling  (1955).  However,  for  finite  n  there  will  be  some 
difference  In  the  null  distributions,  and  these  have  been  found  by 
Monte  Carlo  methods.  A  list  of  upper  tail  percentage  points  Is  given 

in  Tables  1  and  2.  The  slight  difference  In  values  of  the  asymptotic 

2  2  2 

distributions  for  W  ,  U  and  A  represents  an  improvement  in  the 
calculations  of  these  points  since  the  appearance  of  Stephens  (1974), 
although  use  of  the  earlier  points  makes  only  a  very  small  difference 
in  a-values.  has  also  been  examined  by  Van  Soest  (1969),  who 

gave  asymptotic  theory  and  Monte  Carlo  points  for  n  -  10  and  20.  For 
/nD*^,  /nD~,  and  /nD  the  asymptotic  points  have  been  found  by  extra¬ 
polating  exact  points  given  by  IXirbln  (1975)  for  Case  4. 

The  test  of  fit  therefore  consists  of  calculating  the  appropriate 
statistic  required,  as  described  above,  and  referring  either  to  Table  1 
for  direct  percentage  points,  or  calculating  the  modified  form  of  the 


test  statistic  as  given  in  Table  2  and  referring  to  the  asymptotic 
points  only.  The  modifications  are  very  useful  when  making  a 
computer  program  for  the  tests. 

Example.  We  illustrate  the  procedure  on  the  set  of  data,  kindly 
provided  by  Dr.  W.G.  Warren,  listed  in  Table  3.  There  are  n  *  32 
observations,  measurements  of  modulus  of  rupture  of  wood  beams.  From 

_  A  A 

the  data  we  have  x  <■  94.878;  then  B  >  53.356  and  A  >  41.523.  The 
EDF  statistics  have  values  /nD^  *  1.115,  /nD  >  1.989,  /nD  ■  1.989, 
/nV  -  3.104,  W^  -  1.066,  -  0.780,  -  5.097.  Reference  to  Tables 

1  and  2  show  all  these  statistics  to  be  highly  significant  and  we 
reject  the  null  hypothesis  that  the  values  are  exponentially  distri¬ 
buted. 

Comment.  In  other  goodness-of-fit  tests  based  on  the  EDF,  simple 
maximum  likelihood  estimators  have  usually  been  used,  even  when  these 

A 

are  biased.  In  this  case  the  maximum  likelihood  estimator  of  A  would 

be  Xj^;  the  first  value  of  w^  would  then  be  0,  and  the  first 

value  of  would  be  0  also.  This  would  produce  an  infinite 

2 

value  for  the  statistic  A  ,  so  that  this  statistic  could  not  be 

used.  The  change  would  of  course  also  affect  the  Monte  Carlo  points 

for  finite  n  for  the  other  statistics,  although  the  asymptotic 

2 

distribution  would  remain  unchanged.  However,  since  A  has  frequen¬ 
tly  been  found  to  be  a  powerful  statistic  in  goodness-of-f it,  the 

2 

unbiased  estimate  of  A  has  been  used  and  statistic  A  Is  then 
available  for  test  purposes. 


3.  REGRESSION  METHODS 


Suppose  1  -  Is  an  ordered  random  sample  from  exp(0,l) 

and  let  m^  ■  E(u^);  then  E(xp  •  A4Bm^  as  In  (2)  above.  Test 

statistics  of  fit  can  then  be  based  on  how  closely  the  data  fit 

this  model.  A  possible  test  statistic  is  the  correlation  coefficient 

r(x,m)  calculated  from  the  pairs  (x^,m^),  1  >  l,...,n.  Strictly 

speaking  this  Is  not  a  correlation  coefficient,  since  the  m^  are 

constants  and  not  random  variables,  but  It  has  become  customary  to 

use  this  terminology  since  r(x,m)  is  calculated  from  the  usual 

formula  for  paired  random  variables,  as  follows. 

Let  S^  »  Z.Cx  -x)  ,  S  -  Z  (x  -x)(m  -m)  and  S  -  Z.(m  -m)  , 
XX  1  1  xn  11  1  nm  x  i 

where  x  and  m  are  the  means  of  the  x-values  and  the  m-values 

2  2 

respectively.  Then  r  (x,m)  -  S__  /(S  S _ ).  An  alternative  method 

XB  XX  m 

of  plotting  Is  to  put  h^  ■  -ln{l-i/(n+l) J  along  the  x-axls.  Instead 
of  m^:  h^  Is  a  good  approximation  to  m^  and  is  easier  to  calculate. 
The  corresponding  correlation  coefficient  will  be  called  r(x,h).  For 
both  these  statistics  a  high  value  (that  Is,  approaching  1)  will  suggest 
a  good  fit  to  the  exponential  model. 

It  Is  comaon  in  regression  to  set  up  an  Analysis  of  Variance  table 
with  SS  denoting  Sum  of  Squares: 


Source  of  variation 
2 

Regression  SS  «  S  /S 

xm  mm 

2 

Error  SS  ■  S_-S 

XX  xm 

Total  SS  ■  S 


3CK 


7 


The  Einor  SS  may  also  be  written  Z(x--x.)  ,  where  x.  -  A  +  Bm. 

£2-1  1  1 

and  is  clearly  also  a  measure  of  how  well  the  data  fits  the  estimated 


line  (2);  in  fact  Error  SS/Total  SS  T  <■  l-r^(x,m)  can  be  used  as 


the  test  statistic,  with  high  values  of  T  leading  to  rejection  of 

m 


The  parallel  statistic  T^  -  l-r^(x,h)  could  be  used  also;  some 


critical  values  for  use  with  censored  samples  have  been  given  by  Smith 

and  Bain  (1976,  Table  III).  These  were  found  by  Honte  Carlo  sampling. 

In  Table  4  of  this  paper  are  given  critical  values  for  T  ,  and  further 

flu 

values  for  T^.  These  are  based  on  Monte  Carlo  samples,  using  10,000 
samples  for  each  n.  For  Table  III  of  Smith  and  Bain  (1976)  the  number 
of  Monte  Carlo  samples  was  100,000/n,  so  that  the  values  in  Table  4 
for  n  >  10  might  be  expected  to  be  more  accurate. 


Example.  For  the  example  quoted  In  Section  2,  we  have  r^(x,m)  >  0.640 


and  r^(x,h)  «  0.611;  the  corresponding  values  of  T  and  T.  are  0.360 

m  n 


and  0.389.  Reference  to  Table  4  shows  these  statistics  to  be  clearly 
slf  f leant  at  level  0.01;  thus  the  hypothesis  that  these  data  are 
exponential  Is  rejected. 

An  alternative  method  of  measuring  the  goodness-of-f it  was  intro¬ 
duced  by  Shapiro  and  Wllk  (1972) ,  and  consists  of  comparing  the  generalized 
least  squares  estimate  of  B  given  from  the  linear  model  (2)  with  the 
estimate  obtained  from  the  sample  variance.  The  resulting  test  statistic 
Ug  Is  given  by 

2 


E  ,  ,.2 


(n-l)S 


xx 


Tables  of  percentage  points  for  W  ,  for  n  from  3  to  100,  have 


been  given  by  Shapiro  and  Wllk  (1972).  The  test  Is  a  two-talled  test. 
The  value  of  Is  0.1742,  and  reference  to  Table  1  of  Shapiro  and 

Wllk  (1972)  shows  this  to  be  highly  significant  In  the  lower  tall,  so 
again  will  be  rejected. 

There  Is  a  weakness  In  connection  with  W^.  It  Is  possible  to 

find  other  distributions  for  which  the  limit  of  {x-Xp^}/S^ ,  as 

n-»-«>.  Is  the  same  as  that  for  the  exponential,  i.e.  1,  and  the 

test  based  on  W-  will  not  necessarily  be  consistent;  this  was  noted 
£ 

by  Sarkadl  (1975).  A  distribution  with  this  property,  for  example.  Is 

the  Beta  (a,  b)  distribution,  with  a  <  1  and  b  >  a(a4-l)/(l-a) ;  two 

pairs  (a,b)  are  (1/4,5/12)  and  (1/2, 3/2).  Monte  Carlo  Samples 

of  size  n  were  taken  from  Beta (1/4, 5/12)  and  the  percentage  of  1000 

samples  rejected  by  W_  in  a  test  for  exponent iallty  at  the  10%  level 

%rere  recorded.  The  results  were,  for  n  ■  10,  15.8%;  for  n  ■  20,  9.4%; 

for  n  -  50,  7.2%;  and  for  n  -  100,  2.2%.  Corresponding  percentages  for 

the  statistic  were  61.9%,  92.4%,  100.0%  100.0%.  For  the  same 

sample  sizes,  and  for  samples  from  the  Beta  (1/2, 3/2)  distribution, 

the  percentages  rejected  were,  for  W^,  7.7,  5.8,  4.4,  1.5;  and  for 
2 

A  ,  22.1,  38.2,  73.1,  96.4.  The  inconsistency  of  Is  thus  apparent 

2 

In  contrast,  Gerlach  (1979)  has  shown  that  r  (x,m)  (and  hence  T  ) 

HI 

always  gives  a  consistent  test  and  this  would  appear  to  be  true  for 
2 

r  (x,h)  and  T.  also. 


4.  POWER  STUDIES 


The  efficiencies  of  the  above  methods  of  dealing  with  unknown  A 
have  been  examined  by  extensive  Monte  Carlo  studies.  Random  spaces 
of  size  n  were  taken  from  a  broad  range  of  distributions  including 
especially  those  distributions, such  as  the  Weibull,  ganma  and  lognormal 
distributions,  which  are  used  in  the  literature  as  practical  alternatives 
to  the  exponential  in  models  for  reliability  studies  or  renewal  processes 
These  were  then  tested  for  exponentlallty  using  the  several  methods 
described  above,  and  tables  prepared  showing  the  percentage  of  samples 
declared  significant  when  the  test  was  of  size  a,.  It  was  noted  by 
Stephens  (1974)  and  again  by  Dyer  (1974)  that  knowledge  of  the  exact 
values  of  parameters  (in  the  context  of  testing  for  normality)  does 
not  always  help  in  deciding  on  the  distributional  form;  the  power  of 
the  test  against  densities  of  different  shape  is  greater  when  the 
statistician  estimates  his  own  parameters.  For  this  reason,  tables 
were  prepared  to  compare  the  results  for  A  unknown,  with  those 
obtained  when  A  was  assumed  known  (Case  4  above) .  Powers  for  A 
known  are  given  first,  in  Table  5.  They  agree  within  sampling  fluctua¬ 
tions  with  previous  values  given  in  an  unpublished  technical  report 
by  Stephens  (1978).  Table  6  shows  the  results  for  Method  1  (subtract¬ 
ing  the  minimum  and  treating  as  Case  4) ,  Table  7  gives  results  for  EDF 
statistics  with  A  and  B  estimated  and  Table  8  compares  the  best 

of  the  EDF  statistics  with  the  Shaplro-Wllk  and  with  the  correlation 

2 

statistics  T^  and  T^,  respectively  equivalent  to  r  (x,m)  and 
2 

r  (x,h).  Van  Soest  (1969)  also  gives  some  power  studies  against  Gamma 
alternatives,  and  our  results  agree  where  they  overlap. 


Comments,  (a)  There  Is,  overall,  a  reduction  of  power  for  EDF 


statistics  when  A  must  be  estimated.  In  contrast  to  the  situation 
in  testing  for  normality  using  these  statistics  (Stephens,  1974; 

Dyer,  1974). 

2 

(b)  Of  the  EDF  statistics,  A  is  best  on  the  whole,  for  both 

methods  1  and  3,  though  is  close  behind.  D^  or  D  will 

2 

usually  be  more  powerful  than  A  ,  but  their  limitation  Is  that 
one  must  know  which  of  the  two  should  be  used. 

(c)  In  a  comparison  of  methods  1  and  2,  both  of  which  use  EDF 

2  2 
statistics,  A  with  method  2  Is  overall  slightly  better  than  A 

with  method  1.  When  there  Is  a  very  high  probability  of  small  values 
2 

(e.g.  for  Welbull  (0.5)),  method  1  Is  better.  These  differ¬ 

ences  extend  to  the  other  statistics  also. 

2  2 

(d)  From  Table  8,  where  A  and  W  with  method  2  are  compared 

with  the  regression  statistics.  It  can  be  seen  that  the  simple  regres¬ 
sion  statistics  T^  and  T^  are  poor  In  power,  particularly  for  large 
sample  sizes.  T^^  Is  mostly  worse  than  T^. 

(e)  Statistics  W^,  and  or  A^,  are  competitive  with  each 

other,  with  sometimes  the  former  statistic  and  sometimes  the  latter 
pair  being  more  powerful.  There  are  Indications  that  for  some  alterna¬ 
tives,  e.g.  Beta  (1,4),  the  half-nonsal  and  the  log-normal  distributions. 

Improves  with  large  samples.  But  the  serious  drawback  that  is 

Inconsistent,  and  so  will  rarely  detect  certain  distributions  even  with 
large  samples,  will  put  Wg  out  of  favour  with  many  statisticians. 


(f)  If  D  ,  D~,  or  Wg  are  to  be  used,  some  Indication  of 
the  nature  of  the  alternative  would  have  to  be  available.  Of  course, 
and  D  can  both  be  calculated,  and  this  Is  a  useful  general  pro¬ 
cedure  towards  an  analysis  of  the  data;  but  if  then  the  most  signifi¬ 
cant  statistic  Is  chosen  for  a  test,  there  will  be  an  unknown  change 
In  the  true  significance  level.  In  view  of  these  reservations  Involving 

the  other  statistics,  our  overall  recommendation  would  therefore  be 

2 

to  use  method  2  with  A  ,  as  described  In  Section  2,  for  an  overall 
omnibus  statistic  for  this  problem. 
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TABLE  1 

Upper  tell  percentage  points  for  /nD^ ,  /nD  ,  /nD,  /nV,  , 

and  A^,  for  a  teat  of  exponent lality  with  location  and  scale  para 

meters  estimated  as  in  Section  2* 
a)  Scaclscic  VnD^ 

Up|H‘r  cnil  significance  level  a 


TAKI.I-:  1  Continued 


.25 

.  15 

.  10 

.05 

.025 

.01 

d)  StaCistii-  ViiV 

5 

1.098 

1. 1H6 

1.234 

1.314 

1 . 400 

1.494 

10 

1.194 

1.294 

1.363 

1.461 

1.556 

1,662 

15 

1.225 

1.325 

1.392 

1.504 

1.596 

1.701 

20 

1.245 

1.34(. 

1.419 

1.536 

1.635 

1.  769 

25 

1 . 2()0 

1 . 366 

1.438 

1.559 

1.658 

1.796 

50 

1.292 

1.400 

1.481 

1.600 

1.701 

1.847 

100 

1.310 

1.419 

1 . 502 

1.647 

1.740 

1.897 

fJTI 

1.334 

1.444 

1.532 

1.656 

1.770 

1.910 

•» 

e)  Statistic  W“ 

5 

.083 

.  10? 

.117 

.141 

.  166 

.  197 

10 

.097 

.  122 

.  142 

.176 

.211 

.259 

15 

.103 

.  130 

.151 

.  188 

.229 

.281 

20 

.  lOA 

.133 

.157 

.  195 

.2  37 

.293 

25 

.  107 

.135 

.  160 

.  199 

.24  7 

.301 

50 

.111 

.  141 

.  166 

.209 

.256 

.319 

100 

.113 

.  144 

.170 

.215 

.263 

.328 

» 

.116 

.148 

.175 

.222 

.271 

.338 

f)  Statistic 

5 

.068 

.083 

.093 

.113 

.  131 

.153 

10 

.075 

.094 

.108 

.131 

.155 

.187 

15 

.080 

.099 

.114 

.139 

.165 

.200 

20 

.082 

.102 

.117 

.143 

.170 

.207 

25 

.083 

.  104 

.119 

.146 

.173 

.212 

50 

.087 

.108 

.124 

.152 

.180 

.223 

100 

.089 

.110 

.126 

.155 

.184 

.229 

CD 

.090 

.112 

.129 

.159 

.189 

.236 

2 

g)  Statistic  A 

5 

.460 

.555 

.621 

.725 

.848 

.989 

10 

.545 

.660 

.747 

.920 

1.068 

1.352 

15 

.575 

.720 

.816 

1.009 

1.198 

1.495 

20 

.608 

.757 

.861 

1.062 

1.267 

1.580 

25 

.625 

.784 

.890 

1.097 

1.317 

1.635 

50 

.680 

.838 

.965 

1.197 

1.440 

1.775 

100 

.710 

.875 

1.008 

1.250 

1.510 

1.855 

m 

.736 

.916 

1.062 

1.321 

1.591 

1.959 

TABLK  2 


2  2  2 

Modifications  and  percentage  points  For  the  modified  statistics  W  ,  U  ,  and  A  . 

Upper  tall  significance  level  a 

Statistic  Modification  .25  .15  .10  .05  .025  .01 


TABLE  3 


Thirty-two  ordered  values  of  modulus  of  rupture  for  Douglas  Fir 
and  Larch  two-by-foura. 


43.19 

90.63 

100.22 

109.62 


49.44 

92.45 
103.48 
110.81 


51.55 

94.24 

105.54 

112.75 


55.37 

94.35 

105.54 

113.64 


56.63 

94.38 

107.13 

116.39 


67.27 

98.21 

108.14 

119.46 


78.47 

98.39 

108.64 

120.33 


86.59 

99.74 

108.94 

131.57 


Values  and  significance  levels  of  the  regression  and  EDF  test 
statistics  for  exponentiallcy  calculated  on  the  above  data. 


Statistic 

value 

VnD'*’ 

1.115 

VnD- 

1.989 

VnD 

1.989 

VnV 

3.104 

w2 

1.066 

0.780 

5.097 

r^(x,«) 

0.640 

r^(x,h) 

0.611 

«E 

0.174 

significance  level  a 

<.025 

<.01 

<.01 

<.01 

<.01 

<.01 

<.01 

<.01 

<.01 

<.01 
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TABLE  4 


Percencage  points  for 


l-r^(x,«)  and  for  -  l-r^(x,h) 


Statistic  T 

n 


upper  tail  c- level 


n 

.25 

.15 

.10 

.05 

.025 

.01 

5 

.140 

.178 

.201 

.261 

.328 

.392 

10 

.104 

.133 

.156 

.192 

.232 

.272 

15 

.087 

.110 

.129 

.163 

.194 

.232 

20 

.076 

.096 

.113 

.141 

.169 

.210 

25 

.065 

.083 

.097 

.123 

.152 

.189 

50 

.044 

.056 

.067 

.085 

.108 

.143 

100 

.028 

.036 

.043 

.057 

.074 

.105 

«0 

.000 

.000 

.000 

.000 

jooo 

.000 

Statistic  T. 

n 


5 

.136 

.167 

.192 

.235 

.285 

.352 

10 

.102 

.129 

.154 

.199 

.234 

.285 

15 

.084 

.110 

.132 

.170 

.207 

.259 

20 

.073 

.097 

.116 

.152 

.191 

.241 

25 

.064 

.084 

.102 

.135 

.175 

.223 

50 

.044 

.060 

.073 

.101 

.133 

.176 

100 

.029 

.040 

.050 

.069 

.094 

.129 

«0 

.000 

<000 

.000 

.000 

<000 

.000 
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TABLE  5 


Power  Conpnrlsuns:  A  known 


The  cable  shows  the  pcrccntaKu  of  2500  samples  declared  significant 
by  Che  statistic,  when  the  lest  used  was  .*i  10%  test. 


Alternative  Sample 
distr.  size 

n" 

1) 

V 

Beta  (1,4) 

10 

3.6 

16.8 

11.7 

11.4 

1.3.0 

13.5 

10.6 

20 

2.5 

22.7 

15.4 

14.4 

17.7 

15.7 

14.4 

50 

l.l 

39.5 

27.4 

23.6 

31.2 

24.4 

27.8 

Chi-square  1 

10 

54.0 

1.7 

33.6 

30.9 

38.1 

33.6 

56.2 

20 

72.2 

0.7 

57.2 

50.4 

62.6 

53.8 

71.7 

50 

96.5 

1.0 

91.4 

87.3 

94.5 

89.0 

98.3 

Chi-square  4 

10 

1.2 

43.3 

32.5 

29.7 

37.9 

34.4 

31.7 

20 

1.3 

68.7 

56.1 

50.8 

63.7 

55.5 

61.6 

50 

4.5 

95.8 

90.5 

86.7 

95.0 

90.6 

96.3 

Half-cauchy 

10 

53.9 

2.8 

42.7 

36.7 

45.1 

37.8 

46.0 

20 

75. fl 

0.8 

65.6 

57.7 

68.1 

59.2 

68.5 

50 

95,4 

0.0 

92.8 

88.5 

94.2 

89.1 

93.9 

Half-normal 

10 

1.6 

26.2 

19.7 

19.0 

21.2 

20.4 

17.0 

20 

0.8 

38.6 

27.0 

24.8 

31.7 

27.2 

27.4 

50 

1.0 

66.5 

54.4 

48.7 

62.1 

51.6 

58.8 

Log-normal (1) 

10 

16.0 

14.4 

16.6 

16.3 

17.4 

17.5 

14.9 

20 

22.5 

15.6 

22.5 

23.2 

25.6 

26.5 

24.8 

50 

36.0 

23.8 

36.7 

48.4 

43.3 

51.2 

50.5 

Log-no rm(2. 4) 

10 

91.2 

0.2 

83.2 

78.3 

86.0 

79.8 

89.8 

20 

99.6 

0.0 

98.4 

97.0 

99.1 

97.0 

99.4 

SO 

100.0 

0.1 

100.0 

100.0 

100.0 

100.0 

100.0 

Uniform 

10 

3.4 

53.1 

42.2 

47.9 

51.5 

48.0 

44.7 

20 

17.6 

80.0 

69.0 

78.5 

81.2 

74.7 

77.5 

SO 

84.9 

98.8 

97.0 

99.7 

99.4 

98.6 

99.8 

Weibull  (0.5) 

10 

79.4 

0.3 

63.8 

57.1 

68.9 

59.1 

81.1 

20 

95.2 

0.0 

91.0 

84.1 

92.2 

86.2 

96.8 

SO 

100.0 

0.5 

100.0 

99.9 

100.0 

99.9 

100.0 

Weibull  (2.0) 

10 

2.3 

75.7 

65.8 

64.1 

76.2 

69.3 

69.6 

20 

9.8 

96.4 

91.8 

91.8 

97.0 

93.8 

96.6 

SO 

49.8 

100.0 

100.0 

100.0 

100.0 

100.0 

100.0 

I 


TABLE  6 


PowMT  OoaparlsoBS  using  Hsthod  1  Csubtxacttng  fron  all  others). 

The  table  shotrs  the  percentage  of  2500  samples  declared  significant 
by  the  statistic,  when  the  test  used  was  a  10%  test. 


Alternative 

Sample 

4- 

9 

dlstr. 

size 

n 

d" 

D 

V 

Beta  (1,4) 

10 

4.4 

16.8 

12.5 

12.0 

13.8 

13.1 

11.6 

20 

2.9 

22.4 

14.8 

14.6 

17.4 

15.0 

14.8 

50 

1.2 

38.7 

27.0 

23.0 

31.0 

24.0 

27.0 

Chi-square  1 

10 

40.3 

2.5 

23.0 

21.6 

25.3 

22.2 

36.6 

20 

63.2 

1.0 

46.9 

40.7 

51.2 

44.4 

64.1 

50 

94.4 

1.0 

87.3 

81.6 

91.3 

84.9 

96.0 

Chi-square  4 

10 

2.6 

24.8 

18.2 

16.4 

20.4 

18.4 

15.6 

20 

1.4 

42.1 

31.0 

28.9 

35.0 

30.6 

31.7 

50 

2.2 

78.0 

68.0 

63.5 

74.5 

67.3 

74.6 

Half-cauchy 

10 

53.8 

2.6 

41.8 

35.7 

44.4 

37.5 

45.4 

20 

75.6 

0.6 

65.5 

57.3 

68.4 

58.8 

68.7 

50 

95.3 

0.0 

92.5 

88.4 

94.0 

88.9 

93.9 

Half-normal 

10 

1.8 

25.0 

17.7 

17.0 

19.6 

18.8 

15.3 

20 

J.2 

34.8 

24.6 

22.5 

28.3 

24.5 

23.8 

50 

0.9 

64.9 

51.8 

47.1 

60.1 

49.8 

56.6 

Log-normaKl) 

10 

21.8 

7.2 

14.2 

13.0 

15.5 

13.8 

14.9 

20 

29. 1 

6.0 

20.7 

16.3 

22.6 

18.9 

22.2 

50 

42.4 

5.4 

32.1 

29.5 

36.4 

32.3 

35.3 

Log-norm (2. 4) 

10 

86.9 

0.4 

76.3 

71.6 

79.4 

73.4 

84.6 

20 

99.0 

0.0 

97.4 

95.4 

98.3 

95.8 

99.1 

50 

100.0 

0.1 

100.0 

100.0 

100.0 

100.0 

100.0 

Uniform 

10 

3.2 

48.5 

37.9 

43.0 

45.5 

43.6 

39.0 

20 

15.4 

78.0 

66.7 

75.8 

78.0 

72.8 

74.9 

50 

83.0 

98.9 

96.8 

99.7 

99.6 

98.3 

99.6 

Weibull  (0.5) 

10 

68.4 

0.5 

52.5 

45.2 

55.4 

48.2 

66.4 

20 

92.0 

0.0 

85.3 

77.2 

88.0 

79.8 

92.8 

50 

100.0 

0.4 

100. 0 

99.6 

100.0 

99.8 

100.0 

1.8 

0.1 

9.3 


TABLE  7 


Power  Comparisons  using  Method  2 
(using  unbiased  estimates  of  A  and  B) 


The  table  shews  the  percentage  of  2500  samples  declared  significant 
by  Che  statistic,  when  Che  test  used  was  a  lOZ  test. 


Alternative  S 

imple 

+ 

2 

2 

2 

distr. 

size 

D 

n" 

D 

V 

W 

U 

A 

Bet.i  (1,4) 

10 

4.3 

16.0 

11.8 

11.6 

13.7 

11.9 

12.9 

20 

3.1 

22.0 

15.1 

15.6 

17.8 

14.4 

16.4 

50 

1.0 

38.6 

26.9 

22.8 

32.1 

24.7 

29.1 

Chi-square  1 

10 

39.8 

2.4 

23.5 

22.1 

26.8 

25.4 

31.6 

20 

64.3 

0.9 

48.1 

42.8 

52.4 

46.1 

59.1 

50 

93.8 

1.0 

97.3 

81.5 

91.6 

85.7 

95.0 

Chi-square  4 

10 

2.5 

23.6 

16.4 

16.1 

20.0 

16.7 

18.5 

20 

1.6 

41.6 

31.1 

29.1 

35.5 

29.7 

33.9 

50 

1.7 

77.9 

67.7 

63.3 

75.7 

67.8 

76.0 

Half-cauchy 

10 

53.4 

2.4 

42.0 

36.0 

45.9 

40.6 

48.0 

20 

76.3 

0.6 

66.1 

53.8 

69.1 

61.2 

70.3 

50 

95.1 

0.0 

92.5 

88.4 

94.3 

89.6 

94.2 

Half-normal 

10 

1.5 

23.9 

16.7 

16.5 

19.4 

17.0 

17.8 

20 

1.4 

34.2 

24.8 

23.9 

28.6 

23.2 

26.2 

50 

0.7 

64.8 

51.5 

46.7 

61.2 

50.3 

59.2 

Log-normal(l) 

10 

20.3 

6.7 

14.4 

13.0 

16.1 

14.5 

17.0 

20 

29.8 

5.9 

21.3 

17.9 

23.3 

19.4 

24.1 

SO 

40.6 

5.4 

32.1 

29.3 

37.2 

33.1 

37.4 

Log-norn(2.4) 

10 

86.6 

0.3 

76.9 

71.9 

80.4 

77.5 

82.9 

20 

99.0 

0.0 

97.5 

95.8 

98.5 

96.5 

99.0 

SO 

100.0 

O.l 

100.0 

100.0 

100.0 

100.0 

100.0 

Uniform 

10 

2.5 

47.2 

35.6 

42.6 

45.0 

40.4 

44.8 

20 

16.7 

77.6 

66.8 

77.4 

78.4 

71.2 

79.2 

50 

80.9 

98.9 

96.8 

99.7 

99.6 

98.3 

99.7 

Weibull  (O.S) 

10 

68.0 

0.5 

53.2 

45.8 

57.3 

53.2 

62.1 

20 

92.5 

0.0 

86.0 

78.6 

88.5 

82.4 

91.0 

50 

100.0 

0.4 

100.0 

99.5 

100.0 

99.8 

100.0 

Weibull  (2.0)  10 


0.8  -'.l.ft  31.4  30.8  38.4  31.8  36.3 


I '  » 

I.’,' 


F* 


TABLE  8 

Power  Comparisons:  regression  and  EDF  statistics 

The  table  shows  the  percentage  of  2500  samples  declared  significant 
by  the  statistic,  when  the  test  used  was  a  lOZ  test. 


Alternative  Sample 


distr. 

size 

r 

■f. 

U 

A^ 

m 

h 

E 

Beta  (1,4) 

10 

11.4 

8.6 

11.4 

13.7 

12.9 

20 

11.2 

5.6 

17.3 

17.8 

16.4 

50 

22.2 

(>.  7 

43.5 

32.1 

29.1 

Chi-square  1 

10 

M.4 

18.4 

25.0 

26.8 

31.6 

20 

17.8 

23.0 

39.0 

52.4 

59.1 

50 

29.4 

36.5 

78.6 

91  .6 

95.0 

Chi-square  4 

10 

13.5 

9.8 

16.0 

20.0 

18.5 

20 

13.2 

7.5 

29.2 

35.5 

33.9 

50 

19.4 

9.4 

72.7 

75.7 

76.0 

Half-cauchy 

10 

36.8 

44.1 

48.3 

45.9 

48.0 

20 

63.0 

68.1 

71.6 

69.1 

70.3 

50 

92.0 

93.8 

96.0 

94.3 

94.2 

Half-normal 

10 

15.8 

10.2 

17.0 

19.4 

17.8 

20 

19.3 

9.3 

29.6 

28.6 

26.2 

50 

32.2 

12.6 

72.4 

61.2 

59.2 

Log-normal(l) 

10 

14.4 

19.0 

19.1 

16.1 

17.0 

20 

24.4 

29.5 

26.5 

23.3 

24.1 

50 

42.8 

49.4 

47.4 

37.2 

37.4 

Log-norm  (2.4) 

10 

54.2 

64.4 

76.4 

80.4 

82.9 

20 

81.5 

86.6 

96.0 

98.5 

99.0 

50 

98.8 

99.1 

100.0 

100.0 

100.0 

Uniform 

10 

56.0 

40.2 

48.0 

45.0 

44.8 

20 

88.7 

71.3 

82.6 

78.4 

79.2 

50 

100.0 

99.5 

99.8 

99.6 

99.7 

Welbull  (0.5) 

10 

30.3 

40.6 

54.4 

57.3 

62.1 

20 

49.8 

58.4 

82.0 

88.5 

91.0 

50 

82.5 

85.8 

99.4 

100.0 

100.0 

Uelbull  (2.0) 

10 

27.4 

18.8 

37.8 

38.4 

36.3 

20 

38.8 

22.0 

73.3 

74.6 

73.2 

50 

71.6 

48.0 

99.7 

99.4 

99.4 
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ABSTRACT 


TESTS  FOR  EXPONENTIALITY  WHEN  ORIGIN  AND  SCALE 
PARAMETERS  ARE  UNKNOWN 
By 

John  Splnelll  and 
Michael  A.  Stephens 


nxcnaex  a.  ocepnens  ^  ^ 

Several  methods  are  examined  for  testing  for  the  exponential 
distribution,  when  both  the  scale  and  location  parameters  are  un¬ 
known.  Percentage  points  are  given  for  EDF^tests.  In  which  both 
parameters  are  estimated  from  the  data,  and  the  tests  are  compared  for 
power  with  EDP  tests  where  only  the  scale  parameter  Is  unknown.  In 
addition,  power  studies  are  reported  to  compare  these  tests  with  other 
goodnesa-of-flt  techniques  for  the  exponential  distribution, 
particularly  those  based  on  regression  methods,  and  it  Is  found  that 
the  EDP  statistics  and  A^  have  overall  good  power  properties. 
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